Reduction Optimization in Heterogeneous Cluster Environments

نویسندگان

  • Pangfeng Liu
  • Da-Wei Wang
چکیده

Network of workstation (NOW) is a cost-e ective alternative to massively parallel supercomputers. As commercially available o -the-shelf processors become cheaper and faster, it is now possible to build a cluster that provides high computing power within a limited budget. However, a cluster may consist of di erent types of processors and this heterogeneity complicates the design of e cient collective communication protocols. For example, it is a very hard combinatorial problem to nd an optimal reduction schedule for such heterogeneous clusters. Nevertheless, we show that a simple technique called slowest-noderst (SNF) is very e ective in designing e cient reduction protocols for heterogeneous clusters. First, we show that SNF is actually an approximation algorithm with competitive ratio two. In addition, we show that SNF does give the optimal reduction time when the cluster consists of two types of processors, and the ratio of communication speed between them is at least two.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Adaptive Dynamic Data Placement Algorithm for Hadoop in Heterogeneous Environments

Hadoop MapReduce framework is an important distributed processing model for large-scale data intensive applications. The current Hadoop and the existing Hadoop distributed file system’s rack-aware data placement strategy in MapReduce in the homogeneous Hadoop cluster assume that each node in a cluster has the same computing capacity and a same workload is assigned to each node. Default Hadoop d...

متن کامل

Collective Communication Optimization with Dynamic Programming in Heterogeneous Cluster Environments

Network of workstation (NOW) is a cost-effective alternative to massively parallel supercomputers. However, the heterogeneity complicates the design of efficient collective communication protocols. Nevertheless, we demonstrate that a complex reduction scheduling problem can be transformed into a tree path length optimization problem and we develop dynamic programming techniques to solve this pr...

متن کامل

Parallel asynchronous particle swarm optimization.

The high computational cost of complex engineering optimization problems has motivated the development of parallel optimization algorithms. A recent example is the parallel particle swarm optimization (PSO) algorithm, which is valuable due to its global search capabilities. Unfortunately, because existing parallel implementations are synchronous (PSPSO), they do not make efficient use of comput...

متن کامل

Cost Minimization Heuristics for Scheduling Workflows on Heterogeneous Distributed Environments

Many large scale scientific problems require computing power that goes beyond the capabilities of a single machine. The data and compute requirements of these problems demand a high performance computing environment such as a cluster, a grid or a cloud platform in order to be solved in a reasonable amount of time. In order to efficiently execute workflows and utilize the distributed resources i...

متن کامل

Multi-criteria genetic algorithm applied to scheduling in multi-cluster environments Open

Scheduling and resource allocation to optimize performance criteria in multi-cluster heterogeneous environments is known as an NP-hard problem, not only for the resource heterogeneity, but also for the possibility of applying co-allocation to take advantage of idle resources across clusters. A common practice is to use basic heuristics to attempt to optimize some performance criteria by treatin...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000